| Name | Version | Summary | date |
| the-convergence |
0.1.4 |
API Optimization Framework powered by evolutionary algorithms, multi-armed bandits, and agent societies |
2025-10-25 21:37:06 |
| security-verifiers-utils |
0.1.1 |
Shared utilities for Security Verifiers RL environments |
2025-10-24 16:54:48 |
| ctx-bandits-mcmc |
1.0.1 |
Feel-Good Thompson Sampling for Contextual Bandits: a Markov Chain Monte Carlo Showdown |
2025-10-23 20:41:22 |
| eldengym |
0.3.1 |
A Gymnasium-compatible reinforcement learning environment for Elden Ring |
2025-10-22 10:46:09 |
| verifiers |
0.1.6 |
Verifiers: Environments for LLM Reinforcement Learning |
2025-10-21 01:06:32 |
| orca-gym |
25.10.0 |
OrcaGym Core - Cloud-native robotics simulation platform compatible with Gymnasium API |
2025-10-20 11:22:36 |
| rlgym-tools |
2.3.12 |
Extra tools for RLGym. |
2025-10-12 14:00:24 |
| collectivecrossing |
0.1.3 |
A multi-agent reinforcement learning environment for probing the overlap of MARL and social sciences |
2025-10-10 16:51:04 |
| mettagrid |
0.2.0.25 |
A fast grid-based open-ended MARL environment |
2025-10-09 21:05:31 |
| gym-mcp-server |
0.1.0 |
Expose any Gymnasium environment as an MCP server |
2025-10-08 05:28:54 |
| toolbrain |
0.1.3 |
A framework for training LLM-powered agents to use tools more effectively using Reinforcement Learning |
2025-10-07 08:37:18 |
| rusty-runways |
2.0.4 |
Python bindings for Rusty Runways |
2025-09-13 13:53:38 |
| DroneTSP |
1.0.1 |
Drone TSP Gymnasium environment and wrappers |
2025-09-09 14:54:28 |
| copilotcloud-rl |
0.1.1 |
Python client for CopilotCloud Reinforcement Learning context API |
2025-09-03 13:33:11 |
| league-of-legends-decoded-replay-packets-gym |
0.1.2 |
A Gymnasium environment for League of Legends decoded replay packets, enabling esports research, AI development, and gameplay analysis. |
2025-09-03 04:06:53 |
| xtrade-ai |
1.2.0 |
A comprehensive reinforcement learning framework for algorithmic trading |
2025-09-01 18:09:19 |
| mmrl |
0.1.7 |
Market Making RL - simulation, experiments, and CLI |
2025-08-31 19:44:19 |
| nipd-framework |
1.0.0 |
Network Iterated Prisoner's Dilemma Framework for Multi-Agent Learning |
2025-08-30 23:39:45 |
| satquest |
0.1.2 |
A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs |
2025-08-30 19:51:50 |
| fragaria |
0.1.2 |
Advanced Chain of Thought (CoT) Reasoning API with Reinforcement Learning (RL) |
2025-08-28 21:24:46 |